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On the Sizes of DPDAs, PDAs, LBAs 

by Richard Beigel and William Gasarch 


Abstract 

There are languages A such that there is a Pushdown Automata (PDA) that recognizes A 
which is much smaller than any Deterministic Pushdown Automata (DPDA) that recognizes 
A. There are languages A such that there is a Linear Bounded Automata (Linear Space Tur¬ 
ing Machine, henceforth LBA) that recognizes A which is much smaller than any PDA that 
recognizes A. There are languages A such that both A and A are recognizable by a PDA, but 
the PDA for A is much smaller than the PDA for A. There are languages A \, A 2 such that 
A\,A2,A\ n A2 are recognizable by a PDA, but the PDA for A\ and A2 are much smaller 
than the PDA for A\ n Ay. We investigate these phenomena and show that, in all these cases, 
the size difference is captured by a function whose Turing degree is on the second level of the 
arithmetic hierarchy. 

Our theorems lead to infinitely-many-n results. For example: for-infinitely-many-n there 
exists a language A n recognized by a DPDA such that there is a small PDA for A n , but any 
DPDA for A n is very large. We look at cases where we can get all-but-a-finite-number-of-n 
results, though with much smaller size differences. 

Keywords: Pushdown Automata; Context Free Languages; Linear Bounded Automata; length of 
description of languages 

1 Introduction 

Let DPDA be the set of Deterministic Push Down Automata, PDA be the set of Push Down Au¬ 
tomata, and LBA be the set of Linear Bounded Automata (usually called nondeterministic linear- 
space bounded Turing Machines). Let L(DPDA) be the set of languages recognized by DPDAs 
(similar for L(PDA) and L(LBA)). It is well known that 
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L(DPDA) C L(PDA) C L( LBA). 


Our concern is with the size of the DPDA, PDA, LBA. For example, let A e L(DPDA). Is it 
possible that there is a PDA for A that is much smaller than any DPDA for A? For all adjacent 
pairs above we will consider these questions. There have been related results by Valiant [fT6l . 
Schmidt lfl2l . Meyer and Fischer [fill . Hartmanis [0, and Hay 0. We give more details on their 
results later. 

Throughout the paper E is a finite alphabet and $ is a symbol that is not in E. All of our 
languages will either be subsets of E* or (E U {$})*. 

Convention 1.1 A device will either be a recognizer (e.g., a DFA) or a generator (e.g., a regular 
expression). We will use Ad to denote a set of devices (e.g., DFAs). We will refer to an element of 
Ad as an Ad-device. If P is an Ad-device then let L(P ) be the language recognized or generated 
by P. Let L(M) = {L(P) : P G Ad}. 

Def 1.2 Let Ad and Ad' be two sets of devices such that L(Ad) C L(Ad'). (e.g., DFAs and 
DPDAs). A bounding function for (Ad, Ad') is a function / such that for all A e L(Ad), if 
A E L(Ad') via a device of size n then A e L(M) via a device of size < f(n). 

Def 1.3 

1. The size of a DFA or NDFA is its number of states. 

2. The size of a DPDA or PDA is the sum of its number of states and its number of symbols in 
the stack alphabet. 

3. The size of a CFG or CSL is its number of nonterminals. 
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4. The size of an LBA is the sum of its number of states and its number of symbols in the 
alphabet (note that the alphabet used by the Turing machine may be bigger than the input 
alphabet). 

We now give some examples and known results. 

Example 1.4 Known Upper Bounds: 

1. f(n) = 2 n is a bounding function for (DFA,NDFA) by the standard proof that L(NDFA) C 
L(DFA). 

n O(n) 

2. f(n) = n n is a bounding function for (DFA,DPDA). This is a sophisticated construction 

by Stearns lfl4l . 

3. f(n) = 2 2 ° {n) is a bounding function for (DFA.DPDA). This is a sophisticated construction 
by Valiant lfl8ll . Note that this is a strict improvement over the construction of Stearns. 

4. f(n) = 0(n 0(l) ) is a bounding function for (CFG,PDA). This can be obtained by an inspec¬ 
tion of the proof that L(PDA) C L(CFG). 

5. f(n) = 0(n) is a bounding function for (PDA,CFG). This can be obtained by an inspection 
of the proof that L(CFG) C L(PDA). 

6. f(n ) = 0(n) is a bounding function for (CSG,LBA). This can be obtained by an inspection 
of the proof that L(LBA) C L(CSG). 

7. f(n) = 0(n ) is a bounding function for (LBA,CSG). This can be obtained by an inspection 
of the proof that L(CSG) C L(LBA). 


Example 1.5 Known Lower Bounds: 
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1. Meyer and Fischer liTTTl proved that (1) If / is the bounding function for (DFA,NDFA) then 
2 n < f(n). (2) If / is the bounding function for (DFA,DPDA) then 2 2 ° (n) < f(n). (3) If / 
is the bounding function for (DFA,CFG) then HALT <t /. The sets they used for (3) were 
finite. 

2. Let UCFG be the set all unambiguous context free grammars. Valiant [fl6l showed that if / 
is the bounding function for (DPDA,UCFG) then HALT <t f ■ 

3. Schmidt Ifl2l showed that if / is the bounding function for (UCFG,CFG) then HALT < T /. 

4. Hartmanis [j6] showed that if / is the bounding function for (DPDA,PDA) then HALT < T 
/• 

5. Harel and Hirst ([[ 51 , Proposition 14 and Corollary 15 of the journal version, (Proposition 12 
and Corollary 13 of the conference version) have shown that if g is computable then g is not 
a bounding function for (DPDA,PDA) in a very strong way. They showed that for all n > 0 
there is a language L n such that (1) there is a PDA for L n of size 0(n), but (2) any DPDA 
for L n requires size at least g(n). 

6. Hay [O showed that if / is the bounding function for (DPDA,PDA) then HALT < T /. She 
also showed that there is a bounding function / for (DPDA,PDA) such that / < T INF. 
(INF is the set of all indices of Turing machines that halt on an infinite number of inputs. 
It is complete for the second level of the arithmetic hierarchy and hence strictly harder than 
HALT.) 

7. Gruber et al. [jH proved several general theorems about sizes of languages. Hay’s result 
above is a corollary of their theorem. 

Note 1.6 The results above that conclude HALT <t / were not stated that way in the original 
papers. They were stated as either / is not recursive or / is not recursively bounded. However, an 
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inspection of their proofs yields that they actually proved HALT < T /. 

Def 1.7 Let Ad be a set of devices. A c-bounding function for Ad is a function / such that for all 
A that are recognized by an Ad-device of size n, if A G L ( Ad) then it is recognized by an Ad - 
device, of size < f{n). One linguistic issue— we will write (for example) c-bounding function for 
PDAs rather than c-bounding function for PDA since the former flows better verbally. 

We now give some examples and known results. 

Example 1.8 

1. f(n) = 2 n is a c-bounding function for NDFAs. This uses the standard proofs that L(NDFA) C 
L(DFA) and that L(DFA) is closed under complementation. 

2. f(n) = 0(n) is a c-bounding function for DPDAs. This is an easy exercise in formal 
language theory. 

3. f(n) — 0(n ) is a c-bounding function for LBAs. This can be obtained by an inspection of 
the proof, by Immerman-Szelepcsenyi [|8][T53, that nondeterministic linear space is closed 
under complementation. 

Def 1.9 Let Ad be a set of devices. An i-bounding function for Ad is a function / such that for all 
Ai, A 2 that are recognized by an Ad-device of size n, if A 1 (T A 2 G L(Ad) then it is recognized 
by an Ad- device, of size < f(n). One linguistic issue— we will write (for example) i-bounding 
function for PDAs rather than i-bounding function for PDA since the former flows better verbally. 

Example 1.10 

1. f{n) = 2 n is an i-bounding function for DFA. This uses the standard proofs that L(DFA) is 
closed under intersection. 
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2. f(ri) = 2 2n is an i-bounding function for NDFAs. Convert both NDFAs to DFAs and then 
use the standard proof that L(DFA) is closed under intersection. 

Note 1.11 We will state our results in terms of DPDAs, PDAs, and LBAs. Hence you may read 
expressions like L(PDA) and think isn’t that just CFLs? It is. We do this to cut down on the 
number of terms this paper refers to. 

2 Facts and Notation 

We will need the following notation and facts to state our results. We will prove the last item since 
it seems to not be as well known as the others. 

Fact 2.1 

1. Mo, Mi, M 2 ,... is a standard numbering of all deterministic Turing Machines. 

2. M f:fl (x) is the result of running M e (x)for s steps. 

3. HALT is the set {(e,x) : (3 s)[M CtS (x) halts }. HALT is Ti-complete. Hence any 3 
question can be phrased as a query to HALT. Note that any (V) question can also be 
phrased as a query; however, you will have to negate the answer. 

4. INF is the set {e : (Vx)(3 y, s)[y > x A M e>s (y) halts ]}. INF is n 2 - complete. Hence any 
(V)(3) question can be phrased as a query to INF. Note that any (3)(V) question can also 
be phrased as a query; however, you will have to negate the answer. 

5. A <t B means that A is decidable given complete access to set B. This can be defined 
formally with oracle Turing machines. 

6. f <t HALT iff there exists a computable g such that, for all n, f(n ) = lim,,^,*, g(n, s). 
We can take g to have complexity 0(log(n + s)), or even lower. This result is due to Shoen- 
field ff3\l and is referred to as The Shoenfield Limit Lemma. It is in most computability 
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theory books. Note that the domain f is N, the domain of g is N x N, and the codomain of 
both f and g is N. Hence the 1 mig-^ g(n, s ) means that (3s 0 , x)(\/s > s 0 )[g(n, s ) = a:]. 

Proof: We just prove Part 6. 

Let HALT S = {(e,x) : [M e ^{x) halts }. 

Assume / <t HALT via oracle Turing machine M-\ Hence, for all n, Mf 1ALT (n) halts and 
is equal to f(n). Since M HALT (n) halts it does so in a finite amount of time and using only a finite 
number of queries to HALT. Hence there exists s such that for all t > ». = M5 ALT '(n). 

Therefore the following computable function g works. 

{ 0 if M^ ALTs (n) has not converged 

( 1 ) 

M» ALTs (n) if it has converged 

We can obtain a g of very low complexity by (for example) having g(n, s ) computer AT lg * s lg s 
instead of M* ALTa 

Assume there exists a computable g such that, for all n, f(n ) = lim., g{n, s). The following 
algorithm shows / <t HALT. 

ALGORITHM 

Input(n) 

For s 0 = 1 to oo (we will show that this terminates) 

Ask HALT (Vs > s 0 )[g(n, s) = g(n, s + 1)]. 

If YES then output g(n , s) and STOP. 

END OF ALGORITHM 

Since lim^oo g(n, s ) exists there will be an s 0 such that the answer to the HALT question is 
YES. Hence the algorithm terminates. 

I 
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3 Summary of Results 


In this section we summarize our results. We also present them in a table at the end of this section. 

The results of Hartmanis [[6] and Hay 0 mentioned in Exercise Q3] above leave open the exact 
Turing degree of the bounding function for (DPDA,PDA). In Section[4]we resolve this question by 
proving a general theorem from which we obtain the following: 

1. If / is a bounding function for (DPDA,PDA) then INF <t /. 

2. There exists a bounding function for (DPDA,PDA) such that / <t INF. (Hay 0 essen¬ 
tially proved this; however, we restate and reprove in our terms.) 

3. If INF / then for infinitely many n there exists a language A n £ L(DPDA) such that 
(1) any DPDA that recognizes A n requires size > f(n), (2) there is a PDA of size < n that 
recognizes A n . (This follows from Part 1.) 

4. If / is a bounding function for (PDA,LBA) then INF <t /. 

5. There exists a bounding function for (PDA,LB A) such that / <t INF. 

6. If INF / then for infinitely many n there exists a language A n E L(PDA) such that 
(1) any PDA that recognizes A n requires size > f(n), (2) there is an LB A of size < n that 
recognizes A n . (This follows from Part 4.) 

In Section |5] and [6] we find the exact Turing degree of the c-bounding function and the i- 
bounding function for PDAs. We obtain the following: 

1. If / is a c-bounding function for PDAs then INF < T /. 

2. There exists a c-bounding function for PDAs such that / <t INF. 
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3. If INF / then for infinitely many n there exists a language A n such that (1 ) A n . A n £ 
L(PDA), (2) there is no PDA of size < f(n) for A n , but (3) there is a PDA of size < n for 
A n . (This follows from Part 1.) 

4. Results 1,2,3 but with i-bounding functions instead of c-bounding functions. 

Note that we have several results of the form for infinitely many n ... that use unnatural lan¬ 
guages. We would like to have for all but finitely many n ... results that involve natural languages. 
We need the following definitions. 

Def 3.1 Let A(n) be a statement about the natural number n. A(n) is true for almost all n means 
that A(n) is true for all but a finite number of n. 

Def 3.2 (Informal) A language is unnatural if it exists for the sole point of proving a theorem. 

Example 3.3 

1. Languages that involve Turing configurations are not natural. 

2. Languages created by diagonalization are not natural. 

3. The language {ww : \w\ — n} is natural. 

Note 3.4 We will sometimes state theorems as follows: there exists a (natural) language such that 
.... If we do not state it that way then the language is unnatural. 

In Sections [7|we obtain the following/or almost all n result^!- 
For almost all n there exists a (natural) language A n € L(DPDA) such that 

1. Any DPDA for A n requires size > . 

'Harel and HirstQ obtained these independently 21 years ago. We comment on how their proofs and our proofs 
differ in Section [7] 
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2. There is a PDA of size 0{n ) that recognizes A n . 


Note 3.5 As noted in Example 1 1.51 5 Harel and Hirst 0 have a stronger result; however, the lan¬ 
guage they uses is not natural. 

Section [7] also has the following result. In fact, we prove this result first and derive the result 
above from it. 

For almost all n there exists a (natural) language A n such that 

1. A n , A n e L(PDA). 

2. Any PDA for A n requires size > A 2 ” !,n . 

3. There is a PDA of size 0(n) that recognizes A n . 

In Section [8] we show the following: 

For almost all n there exists a (natural) language A n e L(PDA) such that 

1. Any PDA for A n requires size > ‘'A- 1 '". 

2. There is an FBA of size 0(n ) that recognizes A n . 

In Section|9]we obtaiij] a for almost all n result for (PDA,FBA): 

Fet / be any function such that / < T HALT. For almost all n there exists a finite language A n 
such that 

1. Any PDA for A n requires size > f(n). 

2. There is an FBA of size 0(n) that recognizes A n . 

2 Meyer originally claimed this result. See the discussion in Section[9] 
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We summarize our results in the following tables. 


The first table: The first two columns, JA and AT, indicate that we are looking at the bounding 
functions / for (Af, AT). The third column yields a property that any such / must have. The 
fourth column states whether we get as a corollary to the proof a result about for-infinitely-many 
n (io) or for-almost-all-n (ae). The fifth column states if the sets involved are natural or not. The 
sixth column states if the condition on / from the third column captures exactly the Turing degree 
of the bounding function. For example, in the first column we see that any bounding function / for 
(DPDA, PDA) has to satisfy INF <t /; however, the YES in column six indicates that there is 
a bounding function of this Turing degree. 


M 

AT 

(V/) 

io/ae 

Nat? 

Exact TD? 

DPDA 

PDA 

INF < T / 

io 

NO 

YES 

PDA 

LBA 

INF < T / 

io 

NO 

YES 

DPDA 

PDA 

y > 2 2fl(n) 

ae 

YES 

NO 

PDA 

LBA 

y > 2 2fl(n) 

ae 

YES 

NO 

PDA 

LBA 

f HALT 

ae 

NO 

YES 


The second table: The first two columns are a set of devices, AT and an operation Op (either 
Complementation (c) or Intersection (i)). We are concerned with the Op-bounding functions / 
for Af. The third column yields a property that any such / must have. The fourth column states 
whether we get as a corollary to the proof a result about for-infinitely-many n (io) or for-almost- 
all-n (ae). The fifth column states if the sets involved are natural or not. The sixth column states 
if the condition on / from the third column captures exactly the Turing degree of the bounding 
function. 
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M 

Op 

(V/) 

io/ae 

Nat? 

Exact TD? 

PDA 

Complementation 

INF < T / 

io 

NO 

YES 

PDA 

Intersection 

INF < T / 

io 

NO 

YES 

PDA 

Complementation 

j ’ >> 

ae 

YES 

NO 


4 Bounding Functions for (DPDA,PDA) and (PDA,LBA) 

In this section we prove a general theorem about bounding functions and then apply it to both 
(DPDA,PDA) and (PDA,LB A). In both cases we show that the Turing degree of the bounding 
function is in the second level of the arithmetic hierarchy. 

We will need to deal just a bit with actual Turing Machines. 

Def 4.1 Let M be a deterministic Turing Machine. A configuration (henceforth config) of M is a 
string of the form a%/3 where a, f3 G £*, o e S, and q e Q. We interpret this as saying that the 
machine has ao/3 on the tape (with blanks to the left and right), is in state q, and the head is looking 
at the square where we put the %. Note that from the config one can determine if the machine has 
halted, and also, if not, what the next config is. 

Notation 4.2 If C is a string then C R is that string written backwards. For example, if C — aaba 
then C R = abaa. 

Def 4.3 Let e,x G N. We assume that any halting computation of M e takes an even number of 
steps. 

1. Let ACC e , x be the set of all sequences of config’s represented by 


d)L/ld)02 d) 03 d)L /4 U) * • • u) 


such that 
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• \c l \ = \c 2 \ = --- = \c s \. 

• The sequence C\ , C' 2 , • • •, C s represents an accepting computation of M e (x). 

2. Let ACC e = U xeN ACC e , x . 

Hartmanis proved the following lemma. 

Lemma 4.4 For all e,x, ACC e , x E L(PDA). For all e, ACC e E L(PDA). In both cases it is 
computable to take the parameters ((e,x) or e) and obtain the PDA. 

Def 4.5 Let M and M! be two sets of devices. 

1. M C AA! effectively if there is a computable function that will, given an A"!-device P, output 
an A4 '-device P' such that L(P ) = L(P'). 

2. M is effectively closed under complementation if there is a computable function that will, 
given an M. -device P, output an M. -device P 1 such that L(P') = L{P). 

3. The non-emptiness problem for A4 is the following: given an A4-device P determine if 

L(P) + t 

4. The membership problem for A4 is: given an A4-device P and x E E* determine if x E 

L(P). 

5. A4 is size-enumerable if there exists a list of devices P \, P 2 , ■ ■ ■ such that 

• M = {L(Pf) : i E N}, 

• (Vi)[|Pi| < |P i+ i|], and 

• the function from i to Pi is computable. 

Note that DFA, NDFA, DPDA, PDA, LBA are all size-enumerable, however UCFG is not. 
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Theorem 4.6 Let A4 and M' be two sets of devices such that the following hold. 

• L(M) C L(PDA) C L(M') effectively. 

• At least one of M., M.’ is effectively closed under complementation. 

• The non-emptiness problem for A4 is decidable. 

• The membership problems for A i and A4' are decidable. 

• Every finite set is in L(Ai). 

• M is size-enumerable. 

Then 

1. If f is a bounding function for (AA,AA') then HALT < T f. 

2. If f is a bounding function for (AA,AA') then INF <t /. 

3. There exists a bounding function f <t INF for (A4. A4 r ). 

4. If INF f then for infinitely many n there exists a language A n G I ( A4 ) such that (1) 

any A4-device that recognizes A n requires size > f{n), (2) there is an AA'-device of size 
< n that recognizes A. (This follows from Part 2 so we do not prove it.) 

Proof: 

1) If / is a bounding function for then HALT <t /. 

Note that 

• If M e (x) halts then ACC ejX has one string, which is the accepting computation of M e (x). 

• If M e (x) does not halt then ACC e:X = 0. 

• Given e, x one can construct a PDA for ACC £:X by Lemma l4~4l 
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We give the algorithm for HALT < T /. There will be two cases in it depending on which of 


Af or AT is effectively closed under complementation. 

ALGORITHM FOR HALT < T / 

1. Input(e,a;) 

2. Construct the PDA P for ACC f:XX . Obtain the device Q in AT that accepts ACC e _ x . 

3. Case 1: Af is effectively closed under complementation. Compute f(\Q\). Let D 1 ,.. ., D t 
be all of the Af-devices of size < fi\Q\). Create the A4 devices for their complements, 
which we denote E 1} ..., E t . 

Case 2: Af' is effectively closed under complementation. Find an AT- device R for L(Q) = 
ACC e ^ x . Compute f(\R\). Let E 1: ..., E t be all of the Af-devices of size < f(\R\). 

Note that at the end of step 3, regardless of which case happened, we have a set of Af-devices 
/■. i,,.., such that 

(e, x) e HALT iff 

(31 <i< t)[L(Ei ) is one string which represents an accepting computation of M e (A)]. 

4. For each 1 < i < t (1) determine if L ( E t ) = 0 (2) if L ( E .,) = 0 then let iv, be the empty 
string, and (3) if L(Ej) T 0 then, in lexicographical order, test strings for membership in 
L(Ei ) until you find a string in L(Ej) which we denote Wi. If {tui,..., contains a string 
representing an accepting computation of M e (x ) then output YES. If not then output NO. 

END OF ALGORITHM 

2) If / is a bounding function for (Af, Af') then INF < T /. 

Note that 
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• If e G INF then ACC e ^ L(PDA) since ACC e is infinite and every string in it begins with 
$C!$Cf$C 3 $ where |C' 1 | = \C?\ = \C 3 \. 

• If e ^ INF then ACC e G L(PDA) since ACC e is finite. 

• Given e one can construct a PDA for ACC e by Lemma l4~4l 

We give the algorithm for INF <t /. There will be two cases in it depending on which of M. 
or AT is effectively closed under complementation. 

In the algorithm below we freely use Fact 12.11 2 to phrase (3)-questions as queries to HALT , 
and Part 1 to answer queries to HALT with calls to /. 

ALGORITHM FOR INF < T / 

1. Input(e) 

2. Construct the PDA P for ACC e . Obtain the device 0 in AT that accepts ACC e . 

3. There are two cases. 

Case 1: M is effectively closed under complementation. Compute f(\Q\). Let Di,...,D t 
be all of the A"!-devices of size < f(\Q\). Create the A4 devices for their complements, 
which we denote E 1: ..., E t . 

Case 2: M.' is effectively closed under complementation. Find an M.'- device R for L{Q) = 
ACC e . Compute f(\R\). Let E 1 ,...,E t be all of the ALdevices of size < f(\R\). 

Note that at the end of step 3, regardless of which case happened, we have a set of Al-devices 
Ei,...,E t such that 

e g INF =► ACC e i L(PDA) =► ACC e £ L(M) =* ACC e £ {. L{E 1 ),... , L{E t )} 
=>■ (3xi,... ,x t )(Vl < i < t)[ACC e (xi ) 7 ^ Ei(xi)]. 
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e f INF =► ACC e is finite => G L(M) =* (31 < z < t)[L(£;) = ACC e ] 

=> -<(3xi, • • ■ ,xt)(Vl < i < t)[ACC e (xi ) 7 ^ ^(xj)]. 

4. Ask (3xi,..., x t )(Vl < i < t)[ACC e (xi) f Efxi)]. (Note that ACC e is decidable so this 
is a (3) question.) If YES then output YES. If NO then output NO. 

3) There exists a bounding function / <t INF for (Ni,Ai'). 

In the algorithm below we freely use Fact l2.11 3 to phrase (3) (V) -questions as queries to INF. 

Algorithm for / 

1. Input(n) 

2. MAX=0. 

3. For every M. '-device P of size < n do the following 

(a) Ask (3Al-device D)(Wx)[P(x) = D(x)}7 

(b) If YES then fori = 1,2,3,... ask (3Al-device D, \D\ = i)(\/x)[P(x) = D(x)]l 
until the answer is YES. 

(c) Let i be the value of % when the last step stopped. Note that (3D, \D\ — i)(Vx)[P(x) = 
D(x)]. If i > MAX then MAX = i. 

4. Output MAX. 

I 

Corollary 4.7 

1. If f is a bounding function for (DPDA,PDA) then INF <t /. 

2. There exists a bounding function for (DPDA,PDA) such that f <t INF. 
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3. If INF f-T f then for infinitely many n there exists a language A n G L(DPDA) such that 
(1) any DPDA that recognizes A n requires size > / (n) for A n , but (2) there is a PDA of size 
< n that recognizes A n . 

4. If f is a bounding function for (PDA,LBA) then INF <t /. 

5. There exists a bounding function for (PDA,LBA) such that f <t INF. 

6. If INF f then for infinitely many n there exists a language A n G L(PDA) such that 
(1) any PDA that recognizes A n requires size > fin), (2) there is an LB A of size < n that 
recognizes A n . 

Proof: We can apply Theorem 14.61 to all the relevant pairs since all of the premises needed are 

either obvious or well known. | 

Note 4.8 Since deterministic time classes are effectively closed under complementation we can 
also apply Theorem l4.6l to get a corollaries about any deterministic time class that contains L(PDA). 
Let 

oj = inf {a : Two n x n Boolean matrices can be multiplied in time 0(n Q )}. 

Le Gall [Q has the current best upper bound: uj < 2.3728639. We abuse notation by letting, for 
all a > 0, DTIME(n") be the set of all deterministic Turing machines that run in time 0(n a ). 
Valiant lITTIl showed that that, for all a > uj, L(PDA) C L(DTIME(?r Q )). If Boolean matrix 
multiplication really is in DTIME(n tJ ) then so is L(PDA). (Lee 0 showed that if L(PDA) C 
DTIME(n 3 ~ e ) then u < 3 — (e/3); therefore the problems of L(PDA) recognition and matrix 
multiplication are closely linked.) Hence, for all a > u (and possibly for uj also) we can obtain a 
corollary about DTIME(n Q ) that is similar to Corollary 14.7[ 

5 c-Bounding Functions for PDAs 
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Theorem 5.1 


1. If f is a c-bounding function for PDAs then HALT <t /. 

2. If f is a c-bounding function for PDAs then INF <t /. 

3. There exists a c-bounding function f <t INF for PDAs. (This is almost identical to the 
proof of Theorem \4~6\ 3 so we do not prove it.) 

4. If INF fj f then for infinitely many n there exists a language A n such that (I) A n . A n 6 
L(PDA), (2) there is no PDA of size < f{n) for A n , but (3) there is a PDA of size < nfor 
A n . (This follows from Part 2 so we do not prove it.) 

Proof: 


• Pi, P 2 ,..., is a size-enumeration of PDAs. 

• / is a c-bounding function for PDAs. 

• g (when on two variables) is the computable function such that ACC B}X is recognized by 

PDA Pg(e,x)- 

• g (when on one variable) is the computable function such that ACC e is recognized by PDA 

1) Let t = f(g(e,x)). 

(e, x ) 6 HALT iff (31 < a < t)[L(P a ) is an accepting computation of M e (x)]. 

Since both the non-emptiness problem and the membership problem for PDAs is decidable this 
condition can be checked. 

2) Let t = f(g(e)). 

e e INF =*► ACC e £ L(PDA) =*► ACC e £ {L{Pf),..., L(P t )} ==► 
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(3aq, ... ,x t )(Vl < i < t)[Pi(xi ) ^ ACC e (xi)\. 


e ^ INF =*► ACC e is finite =*► ACC e £ (L(Pi),..., L(P t )} ==► 

-■(3xi,... ,x t )(Vl < i < t)[Pi{xi) ^ ACC e (xi)\. 

We can now use / <t HALT to determine if (3x ±,..., x t )(Vl < i < t)[Pi(xi ) 7 ^ ACC e (xj)]. 

is true or not. | 

6 i-Bounding Functions for PDAs 

Def 6.1 We use the same conventions for Turing machines as in Definition [473] Let e.x e N. 

1. ODDACC £:X be the set of ali sequences of config’s represented by 


<ps~i c (Ps~iR(P 

d)0id)02 d)03d)04 d) * * * tDU s d) 

such that 

• |Ci| = |C 2 | and |C 3 | = |C 4 | and ... and |C s _i| = |C S |. 

• For aii odd i, C i+1 is the next config after C,. (We have no restriction on, say, how C 2 
and C 3 reiate. They couid even be of different iengths.) 

• C s represents an accepting config. 

2. Let ODDACC e = U. c6N ODDACC e , x . 

3. EVENACC £)X be the set of aii sequences of config’s represented by 


(P/^i (P S~lR(P S~1 (PS~lR(P (P/^lR(P 

d)Uid)02 d)03d)04 d) * * * d)O s d) 

such that 

• |C 2 | = |C 3 | and |C 4 | = |C 5 | and ... and |C S _ 2 | = |C s _i 
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• For all even i, C i+i is the next config after C t . (We have no restriction on, say, how 6' 3 
and C 4 relate. They could even be of different lengths. We also have no restriction on 
C\ except that it be a config.) 

4. Let EVENACC e = U* eN EVENACC CiX . 

Note that 

1. (e, x ) G HALT iff ODDACC ejX D EV EN ACC e . x contains only one string and that string 
is an accepting computation of M e (x). 

2. e G INF mODDACC e D EVEN ACC e £ L(PDA). 

Using these two facts you can prove the theorem below in a manner similar to the proof of 
Theorem 15.11 

Theorem 6.2 

1- If f is an i-bounding function for PDAs then HALT <t /. 

2. If f is an i-bounding function for PDAs then INF <t /. 

3. There exists an i-bounding function f <t INF for PDA. 

4. If INF f then for infinitely many n there exists languages A n l and A n 2 such that (1) 
A Ui i, A n2 G L(PDA), (2) there is no PDA of size < f(n) for A Ui i (T A nj2 , but (3) there is a 
PDA of size < nfor A n l D A n 2 . 

7 A Double-Exp For-Almost-All Result Via a Natural Language for (DPDA,PDA) 

We show that for almost all n there is a (natural) language A n such that A n has a small PDA but A n 
requires a large PDA. We then use this to show that for almost all n there is a language A n that has 
a small PDA but requires a large DPDA. Neither of these results is new. Harel and Hirst [[5j have 
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essentially proved everything in this section. We include this section because some of our proofs 
are different from theirs and because in most cases they do not explicitly state the theorems. After 
every statement and proof in this section we briefly discuss what they did. We denote their paper 
by HH. 

Lemma 7.1 Let X, Y. Z be nonterminals. Let E be a finite alphabet. 

1. For all n > 2 there is a PDA of size O (log n) that generates {Y n }. 

2. For all n > 2 there is a PDA of size O(logn) that generates {a, b} n . 

3. For all n > 2 there is a PDA of size 0 (log n) that generates 

4. For all n > 2 there is a PDA of size O(logn) that generates {a, b} 

5. For all n > 2 there is a PDA of size O ( log n) that generates { Y- n }. 

6. For all n > 2 there is a PDA of size 0 (log n) that generates {a, b} 

Proof: 

1) We show that there is a CFG of size < 2 lg n that generates { Y" } by induction on n. 

If n = 2 then the CFG for {YY } is 
S ->• YY 

which has 2 = 2 lg 2 nonterminals. 
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If n = 3 then the CFG for {YYY} is 
S ->• Y{Y 
Y -> LL 

which has 3 < 2 lg 3 nonterminals. 

Assume that for all m < n there is a CFG of size < 2 lg m for { Y r " }. We prove this for n. 

• n is even. Let G' be the CFG for { Y n / 2 } with the start symbol replaced by S'. The CFG G 

for {Y n } is the union of G" and the one rule S —* S'S'. This CFG has one more nonterminal 
than G'. Hence the number of nonterminals in G is < 2 lg(n/2) + 1 < 2 lg n 

• n is odd. Let G’ be the CFG for {y( n_1 V 2 } with the start symbol replaced by S'. The 
CFG G for {Y n } is the union of G' and the two rules S —»■ Y S" and S" —> S'S'. This 
CFG has two more nonterminals than G'. Hence the number of nonterminals in G is < 

21g((n — l)/2) + 2 < 21gn. 

2) Add the the productions Y —> a and Y —> b to the CFG from Part 1. 

3) Add the production Y —> e to the CFG from Part 1. 

4) Add the production Y —> e to the CFG from Part 2. 

5) Let G be the O(logn) sized CFG for { Y n } from Part 1. Let G' be the 0(1) sized CFG for Y*. 
The CFG for the concatenation of L(G ) and L{G') is an O(logn) sized CFG for {Y- n }. 

6 ) Add the productions Y —> a and Y —> b to the CFG from Part 5. | 

Note 7.2 Lemma im 1 follows from the first paragraph of the proof of Proposition 16 in the journal 
version of HH (Proposition 14 in the conference version). They used PDAs (which they call 0- 
PDAs) where as we use CFGs and then convert them to PDAs. 

Theorem 7.3 For almost all n there exists a (natural) language A n such that the following hold. 
1. A n ,AfeL( PDA). 


23 




2 . Any DPDA that recognizes A n requires size > 2 


2 ^( 71 .) 


3. There is a PDA of size 0(ri) that recognizes A n . 

Proof: We show there is a language A n such that (1) A n ,A n E L(PDA), (2) any PDA that 

recognizes A n requires size > 2 n ( n \ (3) there is a PDA of size O(logn) that recognizes A n . 
Rescaling this result yields the theorem. 

Let W n = {ww : |w| = n}. Let A n = W n . 

1) A n is cofinite, so both A n and A n are in L(PDA). 

2) Filmus [[2]| showed that any CFG for W n requires size > 2 n ( n \ Hence by Example 1 1.41 4 any 
PDA for W n = A n requires size > 2 n(n f 

3) We present a CFG for A n of size O(logn). By Example II .41 5 this suffices to obtain a PDA of 
size O(logn). We will freely use that Lemma I7TI yields CFG’s of size O(logn) by Example 1 1.41 4. 

Note that if x E A n then either \x\ < 2n — 1, x > 2n + 1, or there are two letters in x that are 
different and are exactly n — 1 apart. These sets are not disjoint. 

The CFG is the union of three CFGs. The first one generates all strings of length < 2 n — 1- By 
Lemma [7TI there is such a CFG of size O(logn). The second one generates all strings of length 
> 2n + 1. By Lemma [7TI there is such a CFG of size 0(log?i). 

The third one generates all strings of length > 2 n where there are two letters that are different 
and exactly n— 1 apart (some of these strings are also generated by the second CFG). By Lemma [7TI 
there is a CFG G' of size O(logn) that generates all strings of length n — 1. Let S' be its start 
symbol. G' will be part of our CFG G. 

Our CFG has start symbol S, all of the rules in G', and the following: 

S -E UaS'hU | UbS'aU 
U ^aU | bU | e 

The union of the three CFG’s clearly yields a CFG of size O(logn) for A n . | 
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Note 7.4 Theorem I7.3l is implicit in Proposition 27 of HH. They use the language 

£* — {tuSruStuSru : |w| = n}. 

We can now obtain a double exponential result about (DPDA,PDA). 

Theorem 7.5 For almost all n there exists a (natural) language A n such that the following hold. 

1. Any DPDA that recognizes A n requires size > 2' 2 ‘ h ' ! \ 

2. There is a PDA of size 0(n ) that recognizes A n . 

Proof: Let A n be as in Theorem 17 .31 (note that it is scaled). We already have that A n has a PDA 

of size 0(n). We show that any DPDA for A n is of size > 2 2fil?,) . Let P be an DPDA for A n . By 
Example 1 1 .81 2 there is a DPDA P' for A n of size 0(|P|). By Theorem 17.31 \P'\ > 2 2f2t,,)) , hence 
|P| > 2 2n(n) , | 

Note 7.6 Theorem 1731 is a special case of Corollary 30 of HH. 

8 A Double-Exp For-Almost-All Result Via a Natural Language for (PDA,LBA) 

We show that for almost all n there is a (natural) language A n that has a small LBA but requires a 
large PDA. 

Theorem 8.1 For almost all n there exists a (natural) language A n such that the following hold. 

1. Any PDA that recognizes A n requires size > 2 2n(n) . 

2. There is an LBA of size O( n) that recognizes A n . 
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Proof: We show there is a language A n such that (1) any PDA for A n requires size > 2 {l</n] and 

(2) there is an LBA of size O(logn) for A n . Rescaling this result yields the theorem. Let 

A n = {w$ui : |w| = n}. 

1) Filmus 0 showed that any CFG for A n requires size > 2 nin K Hence, by Example 1 1,41 4. any 
PDA for A n requires size > 2 ntn) . 

2) We present a CSG for W n of size O(logn). By Example 1 1 .41 7 this yields an LBA of size 
O(logn). 

Here is the CSG for {vj$w : \w\ = n}. 

S —> Y n W (actually use the CFG from Lemma 17711 of size O(logn) to achieve this) 

Y —>■ aA | bB 
Aa —> aA 
Ab ->• bA 
Ba —aB 
Bb —> bB 
AW -2 Wa 
BW ->• Wb 
W -> $ | 

9 A Ginormous For-Almost-All Result for (PDA,LBA) 

Meyer and Fisher lITTil say the following in their Further Results Section: 

... context-sensitive grammars may be arbitrarily more succinct than context-free grammars ... 

The reference given was a paper of Meyer [FTOil . That paper only refers to Turing Machines. 
We exchanged emails with Meyer about this and he informed us that his techniques could be used 
to obtain the result that is Theorem 19. ll below. Rather than work through his proof we provide our 
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own. Our proof is likely similar to his; however, we use the closure of L(LBA) under complemen¬ 
tation Ena which was not available to him at the time. 

We assume that all LBAs are modified so that, on input x, if a branch does not terminate in 
time then that branch will halt and reject. Hence every branch either halts and accepts or halts 
and rejects. 

Let P 1: P 2 ,... be a size-enumeration of all PDAs. We assume that P e is of size > e. We also 
have a list Ni, N 2 ,..., of LBAs such that L(iVj) = L(Pf) and (by the effective closure of L(LBA) 
under complementation) N[, N' 2 ,... such that L(N-) = L(Pj). Note that Ni,N 2 , ■ ■ ■ is not a list 
of all LBAs. 

The following will be key later: Let x e E* and imagine running Nfx), for each path noting 
if it said Y or N, and then running N'(x), and then noting if that path said Y or N. So each path 
ends up with a NN, NY, YN, or YY. 

• x £ L(Pi)\ some path says YN, some paths might say NN, but no path says NY or YY. 

• x ^ L(Pi ): some path says NY, some path might says NN, but no path says YN or NN. 

Note that TV; and N[ run in 0(|x|) space. 

Theorem 9.1 Let f <t HALT. For almost all n there exists a finite set A n such that the following 
hold. 

1. Any PDA that recognizes A n requires size > f(n). 

2. There is an LBA of size 0(n ) that recognizes A n . 

Proof: We construct the language A n by describing an LBA for it (really an NSPACE(|x|) 

algorithm). The idea is that A n will be diagonalized against all small PDAs. The algorithm will 
run in 0(|x|) space. We will comment on the constant in the 0(|x|) later. 

Since / < T HALT, by Fact 12.1 [ 6. there exists a computable g such that (Vn)[/(n) = 
lim^oo g(n, s)]. We can assume g(n, s ) can be computed in space 0(log(n + s)). 
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Fix n. We describe the algorithm for A n . The set we construct will satisfy the following 
requirements: 

For 1 < i < f(n) (which we do not know) 

Ri : A n ^ L(Pi). 

This is only a finite number of requirements; however, we do not know f(n). We will get 
around this by approximating f(n) via g(n, s ). 

The set A n will be a subset of a *. 

ALGORITHM for A n 

1. Input(a s ). 

2. Compute t = g(n, s ). 

3. Deterministically simulate A n on the strings {e, a, a 2 ,... , a lg * s }. Do not store what the 
results are; however, store which requirements indexed < t are satisfied. If so many were 
satisfied that you can’t store them in space < log s then reject and halt. 

4. If all of the requirements Pi as 1 < i < t are satisfied then reject and halt. 

5. (Otherwise) Let i be the least elements of (1,.., , t} such that Ri has not been seen to be 
satisfied. Run (nondeterministically) N t (x) and then N'(x'). Any path that yields NN outputs 
NO. There will be no paths that yields YY. Any path that yields NY output YES (this is 
diagonalization— a NY means that x L(Pi))- Any path that yields YN output NO (this is 
diagonalization— a YN means that x E L{Pi)). Requirements Ri is satisfied. 

END OF ALGORITHM for A n 

By the definition of g there exists s 0 such that, for all s , s' > s 0 , g(n, s ) = g(n , s 0 ) = f(n). 

We show, by induction on i, that for all i < f(n), Ri is satisfied. Assume that for all i' < i < 
f(n), Ri / is satisfied. Let si > s 0 be the least s such that all R v with i' < i are satisfied after A n (a s ) 
is determined. Let s 2 > s 1 be the least s such that for all inputs a- s the algorithm deterministically 
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simulates A Sl and hence notices that, for all %' < i, iG is satisfied. If Ri is satisfied on some input 
in a- S2 then we are done. Otherwise note that on input a s 2 the algorithm will notice that R, is not 
satisfied and satisfy it. 

How big is the LB A for A n ? The LB A only needs the parameter n and a constant number of 
instructions. Hence their is an LBA of size O(logn); however, we only need that there is an LBA 
of size 0(n). 

We show that the algorithm for A n is in NSPACE(0(|x|). Let g max = max{g(n, s ) : s G N}. 
Since lim^oo g(n, s ) exists g max is well defined. It depends on n but not on the input; hence g max 
is a constant. The first four steps of the algorithm take < lg*(|x|) + g mSLX space to execute. For 
large |x| this is far less than x j. Step 5 is the only nondeterministic step. Each branch is the result 
of running a branch of the NSPACE(0(| x |) machines Ni(x) and N'(x) where 1 < i < g max . 
Hence there is a constant c such that for all x each branch of the computation takes < c\x\ space. 
Therefore the algorithm for A n is in NSPACE (O (|x \) . | 

10 Open Problems 

We have pinned down the exact Turing degree of the bounding function for (DPDA,PDA) and 
(PDA,LB A). The exact Turing degree for the bounding functions for (DPDA,UCFG) and (UCFG,PDA) 
are open. 

We have obtained natural languages that show the (1) bounding function for (DPDA,PDA) and 
(PDA,LBA), and (2) the c-bounding function for PDAs, are at least double exponential. It is open 
to find natural languages that show a larger lower bounds. 
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